FBG Model Based Low Rate Coding of Speech
نویسندگان
چکیده
represented by a set of fbg filters each corresponding to a resonance peak where f is the formant frequency, b the 3 dB bandwidth and g the gain of a resonant peak. Speech coding algorithms are basic components in existing and future personal communication systems. Reducing the bit rate in transmitting speech is by way a matter of using slowly varying and numerical robust parameters to represent the speech. We are here focusing on parameters close to the physics of the speech production system for modelling the vocal tract. We have studied the dynamics (time variation) and numerical robustness of the new parameters (f formant frequency, b 3 dB bandwidth, g gain) and compared them with the traditional reflection coefficients. Listening tests indicate a possible reduction in bit rate of about two to three times for comparable sound quality. These fbgparameters will be coded, transmitted, interpolated and used as input into the receiver model in order to reconstruct the speech signal. Time(msec.) F re qu en cy (H z) red: 0, orange: −20, yellow: −40, green: −50
منابع مشابه
Syllable-based pitch encoding for low bit rate speech coding with recognition/synthesis architecture
Current HMM-based low bit rate speech coding systems work with phonetic vocoders. Pitch contour coding (on frame or phoneme level) is usually fairly orthogonal to other speech coding parameters. We make an assumption in our work that the speech signal contains supra-segmental cues. Hence, we present encoding of the pitch on the syllable level, used in the framework of a recognition/synthesis sp...
متن کاملProgress Report of a Project in Very Low Bit-rate Speech Coding
Background work in various levels of speech coding is reviewed, including unconstrained coding and recognition-synthesis approaches that assume the signal is speech. A pilot project in HMM-TTS based speech coding is then described, in which a comparison with harmonic plus noise modelling is also done. Results of the demonstration project including samples of speech under various transmission si...
متن کاملSpectral enhancement preprocessing for the HNM coding of noisy speech
Low rate coders based on the harmonic-noise model are sensitive to acoustic background noise at low SNRs due to the increase in parameter errors from the analysis of noisy speech. We investigate the use of spectral subtraction enhancement preprocessing on the performance of the sinusoidal model based codec both by objective assessment of parameter errors and the subjective testing of output spe...
متن کاملA new approach to modeling excitation in very low-rate speech coding
A new method for two-band approximation of excitation signals in an LPC model, to improve speech naturalness in very low rate coding, is proposed. Based on a simpli ed model of Multi-Band Excitation, the method accurately determines the degree of periodicity, using the concept of Instantaneous Frequency (IF) estimation in frequency domain. The harmonic structure in the spectrum of LPC residual,...
متن کاملA Wavelet-Packet Based Speech Coding Algorithm1
The trend toward real-time, low-bit-rate speech coders dictates current research efforts in speech compression. Such coders are desirable for a number of applications including transmission of digital speech signals and multimedia applications. Multimedia and video conferencing, dynamic web-site access with voice and video introduces the idea of using voice over the Internet. This idea also ope...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2002